Fundamental bound on the reliability of quantum information transmission 
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Information theory tells us that if the rate of sending information across a noisy channel were above 
the capacity of that channel, then the transmission would necessarily be unreliable. For classical 
information sent over classical or quantum channels, one could, under certain conditions, make a 
stronger statement that the reliability of the transmission shall decay exponentially to zero with the 
number of channel uses and the proof of this statement typically relies on a certain fundamental 
bound on the reliability of the transmission. Such a statement or the bound has never been given for 
sending quantum information. We give this bound and then use it to give the first example where 
the reliability of sending quantum information at rates above the capacity decays exponentially to 
zero. We also show that our framework can be used for proving generalized bounds on the reliability. 



Capacity of a given channel is defined as the highest 
rate of sending information (measured as the amount of 
information sent per channel use) reliably in the limit 
of large number of channel uses [1-3]. Converse of the 
channel capacity theorem tells us that sending informa- 
tion at rates higher than capacity would necessarily be 
unreliable. A strong converse additionally tells us that 
the reliability would be very small and, in some cases 
more explicitly, would decay exponentially to zero with 
the number of channel uses. Not all channels have a 
strong converse [4]. 

Such strong converses are available for sending classical 
information across classical or quantum channels (under 
certain conditions) and are typically shown using a fun- 
damental bound on the reliability. But, somewhat sur- 
prisingly, there has been no such strong converse when 
quantum information is sent across a quantum channel 
and an equivalent bound has been unknown. We first 
prove this bound in full generality and then apply it to 
give the first example of a strong converse for quantum 
information transfer where the reliability decays expo- 
nentially to zero with the number of channel uses. 

Strong converse establishes capacity as a sharp thresh- 
old for information transmission and is clearly of great 
theoretical interest. It also has interesting applications 
in cryptography. Let Alice have an unlimited noise- 
free quantum memory to store qubits while Bob has a 
noisy quantum memory (also called the noisy-storage as- 
sumption) . If the strong converse holds for the quantum 
channel modelling the noise that acts on Bob's memory, 
then Alice and Bob can implement any two-party cryp- 
tographic task securely [5]. 

We now provide a more detailed but high level overview 
of our results. A protocol to transfer information (classi- 
cal or quantum) across a noisy communication channel is 
characterised by the amount of information (7?.) it con- 
veys and the reliability (F) it promises. Typical defini- 
tions of reliability ensure that F G [0,1], where F « 1 
would imply a highly reliable information transfer, i.e., 
information sent and reconstructed at the receiver are 
very close to each other (F = 1 implies an exact match) 
and F « would imply a highly unreliable transmission. 



Information could be classical or quantum. A classi- 
cal information is an unknown sequence of bits (such as 
an email message) that Alice wants to send to Bob. A 
quantum information transfer can also be looked upon as 
entanglement transfer [3]. Alice has a quantum system 
S (information) that is entangled with a reference sys- 
tem A and Alice (who doesn't have access to A) wishes 
to send a quantum system through a noisy environment 
(that doesn't act upon A) such that at the end of the 
protocol, the state of A and Bob's system (say S) is close 
to the state of A and S. 

Fundamental bound that we seek for all s £ [— /3, 0) 
and protocol parameters a is given by 



(1) 



where Eo{0,a) ~ 0, the derivative of Eo{s,a) w.r.t. s at 
s = gives us a measure of information that could be 
transferred across the channel reliably, and /3 is a con- 
stant independent of a and TZ that, for our purposes, is 
0.5. 

Eo{s,a) — sTZ is known as the Gallager's exponent 
named after R. G. Gallager who first proposed it in a dif- 
ferent setting [6]. The bound in Eq. (1) was shown when 
classical information is sent across a classical channel 
(Arimoto [7]) and quantum channel (Ogawa and Nagaoka 
[8]). Winter gave another proof of the strong converse 
for sending classical information over quantum channels 
without the Gallager's exponent [9]. Extensions of the 
above results are due to Konig and Wehner (Ref. [10]) 
and further upper bounds to fidelity for entanglement 
unassisted and assisted codes are given by Matthews and 
Wehner [11]. 

The search for quantum Gallager's exponent when 
quantum information is sent across a quantum channel 
has been a longstanding problem and we provide it in 
this paper. Table 1 lists these various cases. 

Our proof relies on using the monotonicity property 
(mentioned below) satisfied by many information diver- 
gences. The idea of proving bounds on the reliability 
for classical protocols using monotonicity dates back to 
Blahut's work [12] and has been used further more re- 
cently [13-15]. 
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TABLE I. Gallager's exponent (that gives an exponential up- 
per bound on reliability) for various cases. 



Information 


Channel 


Proposed by 


Classical 


Classical 


Arimoto (1973) 


Classical 


Quantum 


Ogawa & Nagaoka (1999) 


Quantum 


Quantum 


(this paper) 



We now provide a brief and heuristic explanation as to 
why this bound is considered fundamental. Let us define 
for a single use of channel that I{a) = dEo{s,a)/ds\s=o 
and C ~ laaxa' liot) be called the channel capacity, 
where a' is the part of a that can be changed by fine- 
tuning the protocol [1-3]. There are parameters in the 
setup that can't be changed such as the channel and there 
may be some practical constraints such as energy used 
for transmission that the protocol must obey. Since Eq 
obeys £'o(0,a) = 0, for a negative s near 0, —Ea{s,a) 
« —sl{a) < —sC and the above bound could be weak- 
ened to give F ^ e'^C'^-'^). Hence, if 7^ > C, then F is 
always exponentially bounded away from 1 . If we use the 
channel n times for sending nTZ amount of information, 
then we could, under certain conditions, write the above 
bound as F ^ gsn(K-c)^ If 7^ > C, then F ^ exponen- 
tially with n, i.e., if we are pumping information into the 
channel higher than the capacity, then the transmission 
would be quite unreliable. 

We shall frequently deal with the quantum Renyi di- 
vergences in this paper that for parameter A > are 
given by 



^a(pII^) 



1 



A- 1 



InTr/cr^" 



(2) 



where limit is taken at A = 1. We shall confine ourselves 
with As (1,2] in this paper and deal with finite dimen- 
sional quantum systems. The following two properties 
are needed later. 

Property 1: It has been shown (see Example 4.5 in 
ReL [16]) that for the chosen range of A, D\ satisfies the 
monotonicity property, i.e., for any two un- normalised 
density matrices (that are positive but need not have a 
unit trace) p, a and a completely positive and trace pre- 
serving (CPTP) quantum operation M acting on them, 
we have 



BM'y)>Dy.\M{p)\\M{a)\ 



(3) 



Property 2: We shall also need the following queer 
property that is not difficult to prove. Let IIq = |0) (0| 
and Hi = |1) (1| be two projectors with Hp + Hi = 1. 
Let a e [0,1], ^ e (0,1], p = aHo + (1 - a)Ui, 
a = /3IIq + (1//3 - P)ni, and let us define 



Dx{p\\<y). 



Note that a > but does not have unit trace. Then 
Da(q:||/3) is independent of the choice of {IIq, Hi} and 
increasing for all a > /3. 

We now derive a quantity from the Renyi divergence 

as 

Kx{A)B)p:= inf Dxip^^'Wl ® a^), (5) 

where "H^ is the Hilbert space describing quantum sys- 
tem B and S{Hb) is the set of all density matrices of "Hs, 
and 1 is the identity matrix whose dimensions should be 
clear from the context. Csiszar defined a similar quantity 
in the classical case and related it to the Gallager's expo- 
nent [17]. The following properties of K\{A)B)p would 
be useful later. 

Lemma 1. Let £^^'^ be a quantum operation and 



Proof. See Appendix. 



□ 



Lemma 2. Let p be any quantum state in AA! , and 

^J^A'^B^pAA'Y y^jg^ 



p 



Kx{A)B)p 



A 



1 - A 

where for s = — 1, 

i?o(s,AA^'^^)p :=-lnTr 
Proof. See Appendix. 



AB\ — 



s+1 



□ 



Information processing task: Suppose a quantum 
system S and a reference system A have a state \4')^^ ■ 
Alice only has access to the system S and not to A. Alice 
wants to send her part of the shared state with A to Bob 
using n independent uses of a quantum channel 
such that at the end of the communication protocol chain, 
Bob's shared state with the reference A is arbitrarily close 
to the state Alice shared with A. We shall call TZ to be 
the communication rate and is given by TZ := ln|S'|/n, 
where 1 5*1 is the dimension oi Hs- We shall assume that 
the state of S is given by l/IS*!, i.e., the completely mixed 
state. 

To this end, Alice performs an encoding 
tion given by S^"^^ to get p 
Alice transmits the system A 



r.AA'" 



opera- 



¥B" 



S^A'" 



kAS\ 



_ gS^A'- 

over N"^' 
receives the state p"^^" = 

. Bob applies a decoding op- 



AS 



eration on its part of the received state to get p 
7-s"^s|_^A'"^B" gS^A'- ^^AS^^ I The performance 

of the protocol is quantified by the fidelity given by 



(4) F{<t> 



AS „AS 



\ASpAS\ 



,AS 



If a protocol promises 
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a fidelity not smaller than F, then we shall refer to such 
a protocol as a (n, TZ,1 — ¥) code. 

The maximum rate per channel use for this protocol in 
the limit of large number of channel uses and fidelity arbi- 
trarily close to 1 was proved in a series of papers (see Refs. 
[18-25]). Let the coherent information of the channel 
j^A'^B defined as Q{N) := max^A^' I{A)B)„, where 

^AB ^ ^A'^B^pAA'^^ ^(^^5)^ H{B)„ - H{A,B),, 

and H(A)^ is the von Neumann entropy of a quantum 
state a in system A given by H{A)a = —Trcr In cr. The 
capacity of the channel is now given by the regularisation 
Qrcg(AA) := lim„^oo Q{JV^")/n. 

We now prove an inequality involving the fidelity and 
the rate. 



Theorem 1. 

isfies 

I5A(F||e 



For ¥ > e-"'^, any {n, 7^, 1 - F) code sat- 



-n'R.\ 



< Kx{A)B'' 



IP- 



Proof. Let {\i 
with 11)^^ = 



AS 



} be an orthonormal basis for Has 

AS 

. Consider a CPTP quantum map 



C AS 

2 with Kraus operators |0) (1| , 



j-AS^c ^Yiere \C\ 
and z = 2,3,...,\AS\ 

nf = 1^. Then for all , we have Tip'^'^) 



Let = O'^ and 



(1 -F')n^, T{1 
where F' = ((/)|^^ p^^ 
inequalities 



= e 

AS 



-nTl 



^^0 



nTZ 



-nTl 



)nf, 



We now have the following 



Kx{A)B'^)p> infi5A(p^^||l®(T^) 



Theorem 2. For any quantum state , s S [—1/2, 0), 
the function 



g{s) -hiTr 



TrA (cr 



1/(^+1) 



s+1 



satisfies 



ff(0) = 0, 



dg{s) 



ds 



i=0 



I{A)B), 



and g{s) + (s + 1) In |^| is an increasing function in s. 
Proof. See Appendix. □ 

We note here that only the two above mentioned prop- 
erties of the quantum Renyi divergence are used for our 
results. Hence, if the Renyi divergence is replaced by any 
other divergence that satisfies these two properties, then 
Theorem 1 shall hold for that divergence as well. The 
non-commutative hockey-stick divergence that we now 
define is one such example that for p, cr > 0, and 7 > 1 is 
given by I?(p||cr) — Tr(/9 — 70')"'", where k"*" is the positive 
part of a Hermitian matrix k = k+ — k~, k"*", k~ > 0. It 
can be regarded as a non-commutative generalisation of 
the classical /-relative entropy (see Ref. [26]) using the 
hockey stick function f{x) = [x — 7)+ [27]. We similarly 
define a derived quantity as 



inf 2?(p^^||l®(7^). 



> inf Dx 



F'n^-f^(i-F')nf|| 

= roA(F'||e-"^) 

> roA(F||e-"'^), 

where a and 6 follow from the data processing inequal- 
ity and the definition of Kx , c follows since the quantity 
Da(]F'| |e~"^) is independent of o"^, and d from the Prop- 
erty 2 of Da- □ 

The constraint F > e~"^ may not be seen as weaken- 
ing the bound because, if the constraint is violated, i.e., 
F < e^"'^, then this, by itself, would imply an exponen- 
tial convergence of F to 0. We first note that 



roA(F||e 



-nTZ\ 



> 



X 



■lnF + ?^7^ 



A - 1 

and it follows from Lemma 2 and Theorem 1 that 



(6) 



(7) 



which gives us the quantum Gallager's exponent. The 
properties of Eq are studied by the following theorem. 



QUANTUM ERASURE CHANNEL WITH 
MAXIMALLY ENTANGLED INPUTS 



We show that the fidelity would decrease exponen- 
tially with the number of channel uses for rates above 
capacity for maximally entangled inputs that have the 
full Schmidt rank. 

A quantum erasure channel transmits the input state 
with probability 1 — p and "erases" it, i.e., replaces it 
with an orthogonal erasure state with probability p [28] 
(see also Ref. [29]). The dimension of the output Hilbert 
space is one larger than that of the input. 

A quantum erasure channel A/j^ de- 
fined in Ref. [3], is given by the foUow- 

operators | V(l -p)El=i 1*)^ (^l'^ > 



ing Kraus 

\B M \A' 



pe [0,1], \B\ 



...^\ef{\A't 



1, 



\A'\ 



= \A'\ + 1, (N)"^'} , (N)'^} are orthonor- 
mal bases in T-La' and I-Lb respectively, and |e)^ = 
for j = \B\. The action of the channel can be understood 
as follows 



A'^B/ AA' 



{p^^)^{l-p)a^^+pp^®\ 
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Let a"^^ = ~^^{p'^^ ), where Q increases the dimen- 
sion but leaves the state intact. Then with probability 
1 — p, the channel leaves the state as a^^ and with prob- 
ability p, it erases the state and replaces by |e)^. It is not 
difficult to see that a"^^ is orthogonal to ® |e) (e|^. 

Taking this further for n channel uses, let a"^^ = 
(^gA'~^B^«,n(^pAA'"y rj,^^ output Can be written as the 

sum of 2" orthogonal density matrices where each of 
these matrices results from i erasures i € {0, and 
this occurs with probability (1 — The number of 

states that have suffered exactly i erasures is (") . 

Let Bi-^ ■ ■ ■ Bi^_f, be the quantum systems that have 
not suffered erasures and we could write the state in this 
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case usmg a as 



Mi,...,i„_fc 



ABi 



a 



It now follows that 
P 



i=i 



ABi^---Bi, 

il,---,in-k 



(8) 



2" terms 



where ak,n = (1 ^ p)" ^P^ ■ 

To prove the strong converse, we find an upper bound 
for K\{A)B"'). We assume that p^^" is a maximally 
entangled state with a Schmidt rank of d\ where cLa = 
\A'\. Note that this is the capacity-achieving input for 
this channel and Q{N) = (1 — 2p)+lnd^ is the single- 
letter quantum capacity for this channel [30] (see also 
Ref. [3]). Note that d!\ x p-^-^i '^'^-fc is a projector of 
rank d\ and p^i'"^-n~k \q the maximally mixed state. 



K\{A)B'^) < nEo{s)/s where we define (with some abuse 
of notation) 



£;o(s):=-ln {1 - p)d 



pd', 



and -Eo(O) = 0. Using (6), we have 

F<cxp{r^[s7^-^;o(.s)]}. 
Furthermore, for p G [0, 1/2], 

lim^ = 0(AA). 

Hence, fora^7^> 0(A/'),3sG hl/2,0)s.t. n-Eo{s)/s 
> 0, and thus the strong converse holds. For p > 1/2, 
E'q{0) < and hence, using similar arguments as above, 
for any 7?, > 0, the strong converse holds. □ 

An alternate proof of Theorem 3 using the hockey stick 
divergence is provided in the Appendix. 

To summarise our results, we have given an exponen- 
tial upper bound on the reliability of quantum informa- 
tion transmission. The bound is fundamental in the same 
vein as the bounds known for transmission of classical 
information across classical/quantum channels (see Rcfs. 
[7, 8, 10]) and holds under general conditions. We then 
apply our bound to yield the first known example for ex- 
ponential decay of reliability at rates above capacity for 
quantum information transmission. 

The authors gratefully acknowledge the comments by 
A. Winter. 



Theorem 3. The strong converse holds for the quantum 
erasure channel for the above chosen maximally entangled 
channel inputs. 

Proof. Note the following set of inequalities for s = A^^ — 

1, Ae (1,2] 



Kx{A)B")^--lnTr[TTAip 



AB' 



y 



■In 



< 



E 

2" terms 



afe.7iTr 



,.AB,,-B,^.X 
iM(CH,...,»„_, ) 



■In. 



E 



2"- terms 



■■■A' 



exp 



where a follows from Lemma 3, b follows from (8) and 
the orthogonality of C's and c follows because K\ sat- 
isfies monotonicity and Lemma 3. Using the fact that 



d-A X P 



AAi ■■■A' 



is a projector of rank d^, we get 
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Appendix 
Proof of Lemma 1 

Note that for any S > 0, there exists a such 



that Kx{A)B)p > i:>A(p^^||l (g> a^) - S. Using the 
monotonicity property from (2) in the main text, we 
have Kx{A)B)p > Dx[p^^\\l(g> £^^C{a^)] - S > 



AC I 



a'^') - S = K^^\A)C)p - 5. Since 



info-c Dx(p 

this is true for any (5 > 0, the result follows. 



Proof of Lemma 2 

The proof of Lemma 2 follows straightforwardly from 
the definition oiK,\{A)B) and from the following Lemma. 

Lemma 3 (Quantum Sibson identity). For any quantum 
state p^^ in system AB and D\ as the Renyi divergence 
of order X, we have 



= i?.(a*||a^)4 
where a* - 



A- 1 



log Tr 



Tr^(/^)' 



TrA (p 



Tr Ti-Aip^^Y 



A A 



Proof. For the classical Sibson identity, see Ref. [31]. 
Note that 



1 


A 


- 1 




1 


A 


- 1 




1 



„AB\ 



logTrTr^ (p-^^)" (a 



A- 1 



\1-X 



A 



■ log Tr 



A- 1 
= Dxia*\\aB) + 



A 



AB\ 



A- 1 



logTr 



Tr^ (p 



AB\ 



Since Dx{a*\\aB) > 0, choosing as ^ cr* gives us the 
minimum and the result follows. □ 



Proof of Theorem 2 



For any quantum state cr^^ , s e [—1/2,0), let 



g{s) := -logTr 



Tr^ (a 



AB\l/(l + «) 



s+1 



It easily follows that g{s) = 0. To show that 
dg{s)/ds\^_^ = I{A)B)cr, we use the following differenti- 
ation rule (Lemma 4 in Ref. [8]) for a Hermitian operator 
X{s) parametrized by a real parameter s 

^Trg[X{s)]^Trg'[X{s)]^. 



^AB 



Let the spectral decomposition of a be a 
^ - Xi \i) (i]"*^ and let a; = Tr^ \i) (ij"*^. Hence, we get 
(T^ = Tr^a^^ = and ki := TrA(a^S)i/(^+i) = 



It is easy to see that dni/ds = 



-K2/(s + l), where K2 = J2i \'^^ ^og{X-^^ )cTi. It now 
follows that 



dg{s) _ Trul {k2 - ki log ni) 



ds 



AjCTj) 



^ ^Tr[^A.(logA.)a, 

OS s^O L 

i 

log ( AjCr,;) 

i 

= H{B),-H{A,B)^, 
^I{A)B)„. 

We now show that g{s) + (s + 1) log j^j is an increasing 
function in s. Consider the operators Ei ~ ^(Ji/\A\. 
Then = Tr^ N) /| A| - 1. Since x\ 

7 £ (0, 1] is operator concave, we have, using the operator 
Jensen's inequality and for l/2<a</3<l,7 = a//3, 



1^1 




6 



or g{a - 1) + alog\A\ < .g(/3 - 1) + /3 log | A|. 



An Alternate Proof for Theorem 3 using the 
hockey-stick divergence 



Note that the following set of inequalities hold for the 
hockey stick divergence. 



= ^ «fc,n2?(c.,....:.„_,"iii®Cn,^,.„_:) 



2"^ terms 



2"'tcrms 
2"tcrms 



where a follows from orthogonality of C's, b follows since 
we have removed the tensors with |e) (e|, and c follows 
from monotonicity (see Lemma 4). Using the above, we 
now have 

/C(A)B") <P(p^^"||l®p^") 



< 



E 

fc=0 



n. _ I log 7 I 
2 L2 1ogci^J 



< 



E 

fc=0 



Q!fc.' 



where we have upper bounded Tr(p"^"^i ' "^"-'= — 7I (g) 
by 1 for k < n/2-[log7/(21ogdA)J. Choose 
log 7 = 7i[TZ + Q{Af)]/2 in the above equation. For 



TZ > Q{M), we have n/2- [log 7/(2 log d^)] < np. Simi- 
lar to the quantity defined in Property 2 in the main text, 
we define D. Let Hq = |0) (0| and Hi = |1) (1| be two 
projectors with Hq + Ei = 1. Let a G [0,1], 13 e (0, 1], 
p = ano + (l-a)ni, a = /JEq + (1/^ - /?)ni, and let us 
define 



D(a| 



Vip\\a) 



(9) 



Using the Chernoff bound, the inequality D(F||e "'^) > 



-,-n'R. 



76 and Theorem 1 in the main text, we get 
F<exp{-|[7e-0(AA)]} 



exp 



{-- 



[2p-l) 



41ogdA 



which gives us the strong converse. 

Monotonicity lemma 

Lemma 4. Consider the matrices p, cr > and a scalar 
7 > 0. Then for any CPTP map £ , 

Tr(p-7<T)+ >Tr[£(p)-7fM]+. 

Proof. Let the Jordan decomposition of p — 7(7 = Q — S, 
where Q, S* > 0. Let P := P{e(p)-ie(a)>o}- Then 

Tr(p - 7(t)+ = TrQ 

= Trf (Q) 

> TrP[f (Q) - £{8)] 
= Tr[£(p)-7£:(a)]+, 

where a follows since £ is trace preserving, b follows since 
we are subtracting non-negative terms. □ 



